Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 8859806 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 743.5 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 11 |
|---|
LongitudAcc is highly correlated with Engine Load and 1 other fields | High correlation |
EngineSpeed is highly correlated with EngineAirInletPressure and 2 other fields | High correlation |
Fuel Rate is highly correlated with Engine Load and 2 other fields | High correlation |
Engine Load is highly correlated with Boost Pressure and 2 other fields | High correlation |
Boost Pressure is highly correlated with Engine Load and 2 other fields | High correlation |
EngineAirInletPressure is highly correlated with EngineSpeed and 3 other fields | High correlation |
AcceleratorPedalPos is highly correlated with EngineSpeed and 3 other fields | High correlation |
VehicleSpeed is highly correlated with EngineSpeed | High correlation |
BrakePedalPos is highly correlated with AcceleratorPedalPos | High correlation |
Fuel Rate is highly skewed (γ1 = 52.46319414) | Skewed |
Timestamp has unique values | Unique |
LongitudAcc has 2083141 (23.5%) zeros | Zeros |
EngineSpeed has 175858 (2.0%) zeros | Zeros |
Fuel Rate has 2091514 (23.6%) zeros | Zeros |
Engine Load has 2101609 (23.7%) zeros | Zeros |
Boost Pressure has 415773 (4.7%) zeros | Zeros |
AcceleratorPedalPos has 3574373 (40.3%) zeros | Zeros |
VehicleSpeed has 1260521 (14.2%) zeros | Zeros |
BrakePedalPos has 7210811 (81.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-23 15:56:39.345952 |
|---|---|
| Analysis finished | 2022-11-23 16:07:06.873871 |
| Duration | 10 minutes and 27.53 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 8859806 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.683501885 × 1010 |
| Minimum | 4.757909408 × 1010 |
|---|---|
| Maximum | 1.114190064 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 4.757909408 × 1010 |
|---|---|
| 5-th percentile | 4.942679251 × 1010 |
| Q1 | 5.857681099 × 1010 |
| median | 6.704562371 × 1010 |
| Q3 | 7.504463751 × 1010 |
| 95-th percentile | 8.330114331 × 1010 |
| Maximum | 1.114190064 × 1011 |
| Range | 6.383991232 × 1010 |
| Interquartile range (IQR) | 1.646782652 × 1010 |
Descriptive statistics
| Standard deviation | 1.054857865 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.157830114 |
| Kurtosis | -0.8350820478 |
| Mean | 6.683501885 × 1010 |
| Median Absolute Deviation (MAD) | 8289939704 |
| Skewness | 0.02679453497 |
| Sum | 5.92145301 × 1017 |
| Variance | 1.112725115 × 1020 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.757909408 × 1010 | 1 | < 0.1% |
| 7.263572794 × 1010 | 1 | < 0.1% |
| 7.263574992 × 1010 | 1 | < 0.1% |
| 7.263574808 × 1010 | 1 | < 0.1% |
| 7.263574695 × 1010 | 1 | < 0.1% |
| 7.263574506 × 1010 | 1 | < 0.1% |
| 7.263574394 × 1010 | 1 | < 0.1% |
| 7.263574291 × 1010 | 1 | < 0.1% |
| 7.263574096 × 1010 | 1 | < 0.1% |
| 7.2635738 × 1010 | 1 | < 0.1% |
| Other values (8859796) | 8859796 |
| Value | Count | Frequency (%) |
| 4.757909408 × 1010 | 1 | |
| 4.757909524 × 1010 | 1 | |
| 4.757909633 × 1010 | 1 | |
| 4.757909708 × 1010 | 1 | |
| 4.757909824 × 1010 | 1 | |
| 4.757909908 × 1010 | 1 | |
| 4.75791001 × 1010 | 1 | |
| 4.757910113 × 1010 | 1 | |
| 4.757910227 × 1010 | 1 | |
| 4.757910413 × 1010 | 1 |
| Value | Count | Frequency (%) |
| 1.114190064 × 1011 | 1 | |
| 1.114190052 × 1011 | 1 | |
| 1.114190041 × 1011 | 1 | |
| 1.114190035 × 1011 | 1 | |
| 1.114190023 × 1011 | 1 | |
| 1.114190011 × 1011 | 1 | |
| 1.114190004 × 1011 | 1 | |
| 1.114189993 × 1011 | 1 | |
| 1.114189982 × 1011 | 1 | |
| 1.114189975 × 1011 | 1 |
WetTankAirPressure
Real number (ℝ≥0)
| Distinct | 196 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.23041809 |
| Minimum | 0 |
|---|---|
| Maximum | 13.44525 |
| Zeros | 21990 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 9.99775 |
| Q1 | 10.82515 |
| median | 11.37675 |
| Q3 | 11.8594 |
| 95-th percentile | 12.34205 |
| Maximum | 13.44525 |
| Range | 13.44525 |
| Interquartile range (IQR) | 1.03425 |
Descriptive statistics
| Standard deviation | 1.130928062 |
|---|---|
| Coefficient of variation (CV) | 0.1007022226 |
| Kurtosis | 34.95526587 |
| Mean | 11.23041809 |
| Median Absolute Deviation (MAD) | 0.48265 |
| Skewness | -4.538502418 |
| Sum | 99499325.54 |
| Variance | 1.278998283 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11.23885 | 358789 | 4.0% |
| 11.1699 | 353211 | 4.0% |
| 11.37675 | 348562 | 3.9% |
| 11.4457 | 347726 | 3.9% |
| 11.51465 | 341676 | 3.9% |
| 11.5836 | 337871 | 3.8% |
| 11.7215 | 326389 | 3.7% |
| 11.79045 | 324266 | 3.7% |
| 11.10095 | 315160 | 3.6% |
| 11.8594 | 314491 | 3.5% |
| Other values (186) | 5491665 |
| Value | Count | Frequency (%) |
| 0 | 21990 | |
| 0.06895 | 345 | < 0.1% |
| 0.1379 | 250 | < 0.1% |
| 0.20685 | 216 | < 0.1% |
| 0.2758 | 561 | < 0.1% |
| 0.34475 | 162 | < 0.1% |
| 0.4137 | 188 | < 0.1% |
| 0.48265 | 155 | < 0.1% |
| 0.5516 | 123 | < 0.1% |
| 0.62055 | 161 | < 0.1% |
| Value | Count | Frequency (%) |
| 13.44525 | 1 | < 0.1% |
| 13.3763 | 8 | < 0.1% |
| 13.30735 | 20 | < 0.1% |
| 13.2384 | 49 | < 0.1% |
| 13.16945 | 124 | < 0.1% |
| 13.1005 | 234 | < 0.1% |
| 13.03155 | 659 | < 0.1% |
| 12.9626 | 1694 | < 0.1% |
| 12.89365 | 4128 | |
| 12.8247 | 5858 |
| Distinct | 125 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.03670183072 |
| Minimum | -7.6 |
|---|---|
| Maximum | 13 |
| Zeros | 2083141 |
| Zeros (%) | 23.5% |
| Negative | 3701960 |
| Negative (%) | 41.8% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | -7.6 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -0.2 |
| median | 0 |
| Q3 | 0.2 |
| 95-th percentile | 0.8 |
| Maximum | 13 |
| Range | 20.6 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.5439853105 |
|---|---|
| Coefficient of variation (CV) | -14.82174867 |
| Kurtosis | 73.02868179 |
| Mean | -0.03670183072 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 2.618388325 |
| Sum | -325171.1 |
| Variance | 0.295920018 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2083141 | |
| -0.1 | 870215 | |
| -0.2 | 748469 | 8.4% |
| 0.1 | 732215 | 8.3% |
| 0.2 | 552313 | 6.2% |
| -0.3 | 535586 | 6.0% |
| 0.3 | 410283 | 4.6% |
| -0.4 | 368754 | 4.2% |
| 0.4 | 296249 | 3.3% |
| 0.5 | 236991 | 2.7% |
| Other values (115) | 2025590 |
| Value | Count | Frequency (%) |
| -7.6 | 1 | < 0.1% |
| -7.5 | 1 | < 0.1% |
| -7.3 | 1 | < 0.1% |
| -7.2 | 1 | < 0.1% |
| -7.1 | 1 | < 0.1% |
| -7 | 1 | < 0.1% |
| -6.8 | 1 | < 0.1% |
| -6.5 | 2 | |
| -6.1 | 1 | < 0.1% |
| -6 | 3 |
| Value | Count | Frequency (%) |
| 13 | 1610 | |
| 12.9 | 309 | < 0.1% |
| 5.4 | 1 | < 0.1% |
| 5.2 | 2 | < 0.1% |
| 5.1 | 4 | < 0.1% |
| 5 | 4 | < 0.1% |
| 4.9 | 6 | < 0.1% |
| 4.8 | 10 | < 0.1% |
| 4.7 | 19 | < 0.1% |
| 4.6 | 30 | < 0.1% |
| Distinct | 13234 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1069.539 |
| Minimum | 0 |
|---|---|
| Maximum | 8191.875 |
| Zeros | 175858 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 595.125 |
| Q1 | 893 |
| median | 1156.125 |
| Q3 | 1284.125 |
| 95-th percentile | 1457.875 |
| Maximum | 8191.875 |
| Range | 8191.875 |
| Interquartile range (IQR) | 391.125 |
Descriptive statistics
| Standard deviation | 324.8017937 |
|---|---|
| Coefficient of variation (CV) | 0.3036839178 |
| Kurtosis | 8.085599775 |
| Mean | 1069.539 |
| Median Absolute Deviation (MAD) | 157.75 |
| Skewness | -0.5584022858 |
| Sum | 9475908045 |
| Variance | 105496.2052 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 175858 | 2.0% |
| 600.125 | 42914 | 0.5% |
| 600 | 42852 | 0.5% |
| 600.25 | 42648 | 0.5% |
| 599.875 | 41716 | 0.5% |
| 600.375 | 41183 | 0.5% |
| 599.75 | 40929 | 0.5% |
| 600.5 | 39257 | 0.4% |
| 599.625 | 38892 | 0.4% |
| 599.5 | 36892 | 0.4% |
| Other values (13224) | 8316665 |
| Value | Count | Frequency (%) |
| 0 | 175858 | |
| 22.125 | 1 | < 0.1% |
| 27.625 | 1 | < 0.1% |
| 31.875 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 41.75 | 1 | < 0.1% |
| 42.25 | 1 | < 0.1% |
| 48.25 | 1 | < 0.1% |
| 48.75 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8191.875 | 281 | |
| 2252.5 | 1 | < 0.1% |
| 2246.875 | 1 | < 0.1% |
| 2245.625 | 1 | < 0.1% |
| 2217.5 | 1 | < 0.1% |
| 2208.5 | 1 | < 0.1% |
| 2200.25 | 1 | < 0.1% |
| 2186.25 | 1 | < 0.1% |
| 2184.125 | 1 | < 0.1% |
| 2182.875 | 1 | < 0.1% |
| Distinct | 1104 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.50851436 |
| Minimum | 0 |
|---|---|
| Maximum | 3876.198645 |
| Zeros | 2091514 |
| Zeros (%) | 23.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.887205 |
| median | 7.570816 |
| Q3 | 20.287421 |
| 95-th percentile | 46.962718 |
| Maximum | 3876.198645 |
| Range | 3876.198645 |
| Interquartile range (IQR) | 19.400216 |
Descriptive statistics
| Standard deviation | 70.25923101 |
|---|---|
| Coefficient of variation (CV) | 4.842620635 |
| Kurtosis | 2879.783051 |
| Mean | 14.50851436 |
| Median Absolute Deviation (MAD) | 7.570816 |
| Skewness | 52.46319414 |
| Sum | 128542622.6 |
| Variance | 4936.359542 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2091514 | 23.6% |
| 3.312232 | 108457 | 1.2% |
| 3.371379 | 98713 | 1.1% |
| 3.253085 | 95916 | 1.1% |
| 3.430526 | 71520 | 0.8% |
| 3.193938 | 66671 | 0.8% |
| 3.962849 | 64099 | 0.7% |
| 3.903702 | 59499 | 0.7% |
| 4.021996 | 58611 | 0.7% |
| 4.081143 | 49824 | 0.6% |
| Other values (1094) | 6094982 |
| Value | Count | Frequency (%) |
| 0 | 2091514 | |
| 0.059147 | 8426 | 0.1% |
| 0.118294 | 8197 | 0.1% |
| 0.177441 | 10419 | 0.1% |
| 0.236588 | 14415 | 0.2% |
| 0.295735 | 12906 | 0.1% |
| 0.354882 | 10573 | 0.1% |
| 0.414029 | 9305 | 0.1% |
| 0.473176 | 8035 | 0.1% |
| 0.532323 | 6572 | 0.1% |
| Value | Count | Frequency (%) |
| 3876.198645 | 2798 | |
| 2861.472713 | 1 | < 0.1% |
| 2848.756108 | 1 | < 0.1% |
| 65.0617 | 20 | < 0.1% |
| 65.002553 | 54 | < 0.1% |
| 64.943406 | 72 | < 0.1% |
| 64.884259 | 85 | < 0.1% |
| 64.825112 | 96 | < 0.1% |
| 64.765965 | 71 | < 0.1% |
| 64.706818 | 87 | < 0.1% |
| Distinct | 202 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.32693887 |
| Minimum | 0 |
|---|---|
| Maximum | 106.5 |
| Zeros | 2101609 |
| Zeros (%) | 23.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2.5 |
| median | 23.5 |
| Q3 | 42.5 |
| 95-th percentile | 91 |
| Maximum | 106.5 |
| Range | 106.5 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 27.3352885 |
|---|---|
| Coefficient of variation (CV) | 0.9320880237 |
| Kurtosis | 0.236917458 |
| Mean | 29.32693887 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.960540188 |
| Sum | 259830989 |
| Variance | 747.2179972 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2101609 | 23.7% |
| 100 | 283883 | 3.2% |
| 19.5 | 178000 | 2.0% |
| 19 | 167079 | 1.9% |
| 20 | 133139 | 1.5% |
| 23 | 126156 | 1.4% |
| 23.5 | 122324 | 1.4% |
| 18.5 | 113249 | 1.3% |
| 22.5 | 112198 | 1.3% |
| 24 | 104178 | 1.2% |
| Other values (192) | 5417991 |
| Value | Count | Frequency (%) |
| 0 | 2101609 | |
| 0.5 | 37624 | 0.4% |
| 1 | 27249 | 0.3% |
| 1.5 | 20752 | 0.2% |
| 2 | 19525 | 0.2% |
| 2.5 | 17843 | 0.2% |
| 3 | 19029 | 0.2% |
| 3.5 | 18544 | 0.2% |
| 4 | 20337 | 0.2% |
| 4.5 | 19153 | 0.2% |
| Value | Count | Frequency (%) |
| 106.5 | 1 | < 0.1% |
| 100 | 283883 | |
| 99.5 | 8034 | 0.1% |
| 99 | 8782 | 0.1% |
| 98.5 | 10146 | 0.1% |
| 98 | 9035 | 0.1% |
| 97.5 | 8970 | 0.1% |
| 97 | 8570 | 0.1% |
| 96.5 | 8549 | 0.1% |
| 96 | 8623 | 0.1% |
| Distinct | 206 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2505231002 |
| Minimum | 0 |
|---|---|
| Maximum | 1.76669 |
| Zeros | 415773 |
| Zeros (%) | 4.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.008618 |
| Q1 | 0.060326 |
| median | 0.137888 |
| Q3 | 0.34472 |
| 95-th percentile | 0.887654 |
| Maximum | 1.76669 |
| Range | 1.76669 |
| Interquartile range (IQR) | 0.284394 |
Descriptive statistics
| Standard deviation | 0.2924572663 |
|---|---|
| Coefficient of variation (CV) | 1.167386425 |
| Kurtosis | 4.200984102 |
| Mean | 0.2505231002 |
| Median Absolute Deviation (MAD) | 0.112034 |
| Skewness | 1.99446033 |
| Sum | 2219586.066 |
| Variance | 0.08553125264 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.017236 | 634940 | 7.2% |
| 0 | 415773 | 4.7% |
| 0.008618 | 387450 | 4.4% |
| 0.103416 | 333216 | 3.8% |
| 0.112034 | 315274 | 3.6% |
| 0.094798 | 312851 | 3.5% |
| 0.120652 | 275436 | 3.1% |
| 0.08618 | 264828 | 3.0% |
| 0.025854 | 256636 | 2.9% |
| 0.12927 | 226451 | 2.6% |
| Other values (196) | 5436951 |
| Value | Count | Frequency (%) |
| 0 | 415773 | |
| 0.008618 | 387450 | |
| 0.017236 | 634940 | |
| 0.025854 | 256636 | |
| 0.034472 | 186385 | 2.1% |
| 0.04309 | 154041 | 1.7% |
| 0.051708 | 141626 | 1.6% |
| 0.060326 | 139175 | 1.6% |
| 0.068944 | 161306 | 1.8% |
| 0.077562 | 208530 | 2.4% |
| Value | Count | Frequency (%) |
| 1.76669 | 1 | < 0.1% |
| 1.758072 | 12 | < 0.1% |
| 1.749454 | 7 | < 0.1% |
| 1.740836 | 4 | < 0.1% |
| 1.732218 | 14 | < 0.1% |
| 1.7236 | 10 | < 0.1% |
| 1.714982 | 13 | < 0.1% |
| 1.706364 | 35 | |
| 1.697746 | 52 | |
| 1.689128 | 64 |
| Distinct | 105 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.6404867 |
| Minimum | 32 |
|---|---|
| Maximum | 510 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 32 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 108 |
| median | 116 |
| Q3 | 136 |
| 95-th percentile | 190 |
| Maximum | 510 |
| Range | 478 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 29.34558127 |
|---|---|
| Coefficient of variation (CV) | 0.2317235351 |
| Kurtosis | 5.100646401 |
| Mean | 126.6404867 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 2.044736792 |
| Sum | 1122010144 |
| Variance | 861.16314 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 104 | 824398 | 9.3% |
| 112 | 706514 | 8.0% |
| 102 | 681318 | 7.7% |
| 114 | 586280 | 6.6% |
| 110 | 579524 | 6.5% |
| 106 | 408526 | 4.6% |
| 116 | 404652 | 4.6% |
| 108 | 373868 | 4.2% |
| 118 | 287958 | 3.3% |
| 120 | 249855 | 2.8% |
| Other values (95) | 3756913 |
| Value | Count | Frequency (%) |
| 32 | 1 | < 0.1% |
| 34 | 50 | |
| 50 | 5 | < 0.1% |
| 52 | 20 | < 0.1% |
| 66 | 1 | < 0.1% |
| 68 | 4 | < 0.1% |
| 70 | 6 | < 0.1% |
| 84 | 2 | < 0.1% |
| 86 | 5 | < 0.1% |
| 88 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 510 | 284 | < 0.1% |
| 508 | 20 | < 0.1% |
| 278 | 1 | < 0.1% |
| 276 | 22 | < 0.1% |
| 274 | 45 | < 0.1% |
| 272 | 121 | < 0.1% |
| 270 | 235 | < 0.1% |
| 268 | 473 | < 0.1% |
| 266 | 906 | |
| 264 | 1394 |
| Distinct | 251 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.5999498 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 3574373 |
| Zeros (%) | 40.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 39.2 |
| Q3 | 65.6 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 65.6 |
Descriptive statistics
| Standard deviation | 34.89742951 |
|---|---|
| Coefficient of variation (CV) | 0.9534829884 |
| Kurtosis | -1.376522202 |
| Mean | 36.5999498 |
| Median Absolute Deviation (MAD) | 39.2 |
| Skewness | 0.2653131638 |
| Sum | 324268454.8 |
| Variance | 1217.830586 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3574373 | |
| 100 | 447363 | 5.0% |
| 62.4 | 42550 | 0.5% |
| 64.4 | 42117 | 0.5% |
| 61.6 | 41812 | 0.5% |
| 59.2 | 41640 | 0.5% |
| 62.8 | 41187 | 0.5% |
| 60 | 41009 | 0.5% |
| 58.4 | 40950 | 0.5% |
| 56 | 40880 | 0.5% |
| Other values (241) | 4505925 |
| Value | Count | Frequency (%) |
| 0 | 3574373 | |
| 0.4 | 3662 | < 0.1% |
| 0.8 | 3613 | < 0.1% |
| 1.2 | 3818 | < 0.1% |
| 1.6 | 3876 | < 0.1% |
| 2 | 3453 | < 0.1% |
| 2.4 | 3560 | < 0.1% |
| 2.8 | 4027 | < 0.1% |
| 3.2 | 3828 | < 0.1% |
| 3.6 | 3685 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 447363 | |
| 99.6 | 9811 | 0.1% |
| 99.2 | 9818 | 0.1% |
| 98.8 | 9179 | 0.1% |
| 98.4 | 10237 | 0.1% |
| 98 | 9496 | 0.1% |
| 97.6 | 9937 | 0.1% |
| 97.2 | 10503 | 0.1% |
| 96.8 | 9757 | 0.1% |
| 96.4 | 11112 | 0.1% |
| Distinct | 1077 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.2871541 |
| Minimum | 0 |
|---|---|
| Maximum | 255.97971 |
| Zeros | 1260521 |
| Zeros (%) | 14.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 16.596594 |
| median | 39.19671 |
| Q3 | 56.69559 |
| 95-th percentile | 75.694374 |
| Maximum | 255.97971 |
| Range | 255.97971 |
| Interquartile range (IQR) | 40.098996 |
Descriptive statistics
| Standard deviation | 24.7398133 |
|---|---|
| Coefficient of variation (CV) | 0.6634942757 |
| Kurtosis | -0.7112892788 |
| Mean | 37.2871541 |
| Median Absolute Deviation (MAD) | 19.99872 |
| Skewness | 0.02613198836 |
| Sum | 330356951.6 |
| Variance | 612.0583623 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1260521 | 14.2% |
| 48.996864 | 16579 | 0.2% |
| 48.196134 | 15414 | 0.2% |
| 47.996928 | 15392 | 0.2% |
| 47.793816 | 15347 | 0.2% |
| 48.496896 | 15314 | 0.2% |
| 46.497024 | 14991 | 0.2% |
| 50.09445 | 14919 | 0.2% |
| 47.094642 | 14893 | 0.2% |
| 47.59461 | 14844 | 0.2% |
| Other values (1067) | 7461592 |
| Value | Count | Frequency (%) |
| 0 | 1260521 | |
| 0.999936 | 1811 | < 0.1% |
| 1.097586 | 2344 | < 0.1% |
| 1.199142 | 2526 | < 0.1% |
| 1.296792 | 2888 | < 0.1% |
| 1.398348 | 2984 | < 0.1% |
| 1.499904 | 3240 | < 0.1% |
| 1.597554 | 4561 | 0.1% |
| 1.69911 | 3343 | < 0.1% |
| 1.79676 | 3549 | < 0.1% |
| Value | Count | Frequency (%) |
| 255.97971 | 274 | |
| 255.975804 | 273 | |
| 114.590322 | 1 | < 0.1% |
| 114.391116 | 2 | < 0.1% |
| 114.28956 | 2 | < 0.1% |
| 114.090354 | 1 | < 0.1% |
| 113.992704 | 1 | < 0.1% |
| 113.891148 | 3 | < 0.1% |
| 113.789592 | 1 | < 0.1% |
| 113.691942 | 2 | < 0.1% |
| Distinct | 237 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.194278069 |
| Minimum | 0 |
|---|---|
| Maximum | 97.6 |
| Zeros | 7210811 |
| Zeros (%) | 81.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 20.8 |
| Maximum | 97.6 |
| Range | 97.6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.399784608 |
|---|---|
| Coefficient of variation (CV) | 2.316574966 |
| Kurtosis | 5.564438896 |
| Mean | 3.194278069 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.33313955 |
| Sum | 28300684 |
| Variance | 54.75681224 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 7210811 | |
| 16 | 86831 | 1.0% |
| 15.6 | 81559 | 0.9% |
| 16.4 | 79779 | 0.9% |
| 17.2 | 66932 | 0.8% |
| 16.8 | 62000 | 0.7% |
| 15.2 | 56222 | 0.6% |
| 17.6 | 52422 | 0.6% |
| 18 | 42582 | 0.5% |
| 14.8 | 38975 | 0.4% |
| Other values (227) | 1081693 | 12.2% |
| Value | Count | Frequency (%) |
| 0 | 7210811 | |
| 0.4 | 26871 | 0.3% |
| 0.8 | 12871 | 0.1% |
| 1.2 | 9874 | 0.1% |
| 1.6 | 7565 | 0.1% |
| 2 | 8157 | 0.1% |
| 2.4 | 8523 | 0.1% |
| 2.8 | 7434 | 0.1% |
| 3.2 | 7268 | 0.1% |
| 3.6 | 7045 | 0.1% |
| Value | Count | Frequency (%) |
| 97.6 | 2 | < 0.1% |
| 97.2 | 194 | |
| 96.8 | 13 | < 0.1% |
| 96.4 | 44 | < 0.1% |
| 96 | 1 | < 0.1% |
| 94.4 | 2 | < 0.1% |
| 94 | 5 | < 0.1% |
| 93.6 | 7 | < 0.1% |
| 92.8 | 1 | < 0.1% |
| 92.4 | 2 | < 0.1% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 4.757909e+10 | 10.34250 | 0.2 | 737.375 | 7.511669 | 35.0 | 0.043090 | 106.0 | 44.0 | 6.198822 | 0.0 |
| 1 | 4.757910e+10 | 10.34250 | 0.7 | 965.875 | 12.539164 | 32.5 | 0.051708 | 106.0 | 50.4 | 8.296344 | 0.0 |
| 2 | 4.757910e+10 | 10.34250 | 0.6 | 1253.125 | 17.921541 | 41.5 | 0.086180 | 122.0 | 67.2 | 10.796184 | 0.0 |
| 3 | 4.757910e+10 | 10.34250 | 0.6 | 1586.250 | 27.621649 | 51.0 | 0.232686 | 134.0 | 75.2 | 13.795992 | 0.0 |
| 4 | 4.757910e+10 | 10.34250 | -0.1 | 1208.500 | 0.000000 | 0.0 | 0.422282 | 132.0 | 83.2 | 14.096754 | 0.0 |
| 5 | 4.757910e+10 | 10.34250 | 1.3 | 1297.375 | 34.837583 | 67.5 | 0.310248 | 154.0 | 82.8 | 17.795736 | 0.0 |
| 6 | 4.757910e+10 | 10.34250 | 0.4 | 1523.500 | 27.207620 | 34.0 | 0.560170 | 140.0 | 82.8 | 21.998592 | 0.0 |
| 7 | 4.757910e+10 | 10.34250 | 0.1 | 1260.750 | 14.372721 | 30.5 | 0.293012 | 136.0 | 84.0 | 22.498560 | 0.0 |
| 8 | 4.757910e+10 | 10.27355 | 1.1 | 1383.250 | 11.178783 | 23.0 | 0.577406 | 158.0 | 83.6 | 24.498432 | 0.0 |
| 9 | 4.757910e+10 | 10.27355 | 0.9 | 1245.375 | 39.451049 | 80.0 | 0.361956 | 136.0 | 81.6 | 28.595826 | 0.0 |
Last rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 8859796 | 1.114190e+11 | 11.72150 | -0.2 | 1332.750 | 0.000000 | 0.0 | 0.560170 | 140.0 | 78.8 | 27.295128 | 0.0 |
| 8859797 | 1.114190e+11 | 11.85940 | 0.5 | 1155.250 | 32.057674 | 68.0 | 0.318866 | 144.0 | 74.4 | 26.396748 | 0.0 |
| 8859798 | 1.114190e+11 | 11.92835 | 0.5 | 1252.000 | 31.702792 | 64.5 | 0.387810 | 150.0 | 73.2 | 28.998144 | 0.0 |
| 8859799 | 1.114190e+11 | 11.99730 | 0.4 | 1336.750 | 27.621649 | 51.0 | 0.456754 | 146.0 | 69.6 | 31.197222 | 0.0 |
| 8859800 | 1.114190e+11 | 12.13520 | 0.0 | 1375.625 | 20.642303 | 37.5 | 0.430900 | 138.0 | 64.0 | 32.197158 | 0.0 |
| 8859801 | 1.114190e+11 | 12.13520 | 0.0 | 1395.125 | 15.496514 | 26.5 | 0.353338 | 130.0 | 53.2 | 32.997888 | 0.0 |
| 8859802 | 1.114190e+11 | 12.06625 | -0.1 | 1059.875 | 0.000000 | 1.0 | 0.163742 | 118.0 | 38.0 | 32.595570 | 0.0 |
| 8859803 | 1.114190e+11 | 12.06625 | -0.4 | 1036.750 | 0.000000 | 0.0 | 0.094798 | 112.0 | 0.0 | 31.794840 | 0.0 |
| 8859804 | 1.114190e+11 | 11.99730 | -0.2 | 998.125 | 0.000000 | 0.0 | 0.086180 | 112.0 | 0.0 | 30.595698 | 3.2 |
| 8859805 | 1.114190e+11 | 11.99730 | -0.6 | 933.125 | 0.000000 | 0.0 | 0.068944 | 110.0 | 0.0 | 28.795032 | 18.4 |